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EP 0 399 666 B1 

Description 

The present invention relates to fusion polypeptides where two individual polypeptides or parts thereof are 
fused to form a single amino acid chain. Such fusion may arise from the expression of a single continuous cod- 

5 ing sequence formed by recombinant DNA techniques. 

Fusion polypeptides are known, for example those where a polypeptide which is the ultimately desired 
product of the process is expressed with an N-terminal "leader sequence" which encourages or allows secre- 
tion of the polypeptide from the cell. An example is disclosed in EP-A-116 201 (Chiron). 

Human serum albumin (HSA) is a known protein found in the blood. EP-A-147 198 (Delta Biotechnology) 

10 discloses its expression in a transformed host, in this case yeast. Our earlier application EP-A-322 094 dis- 
closes N-terminal fragments of HSA, namely those consisting of residues 1-n where n is 369 to 419, which 
have therapeutic utility. The application also mentions the possibility of fusing the C-terminal residue of such 
molecules to other, unnamed, polypeptides. 

One aspect of the present invention provides a fusion polypeptide comprising, as at least part of the N- 

15 terminal portion thereof, an N-terminal portion of HSA or a variant thereof and, as at least partof the C-terminal 
portion thereof, another polypeptide except that, when the said N-terminal portion of HSA is the 1-n portion 
where n is 369 to 419 or a variant thereof then the said polypeptide is (a) the 585 to 1578 portion of human 
f ibronectin or a variant thereof, (b) the 1 to 368 portion of CD4 or a variant thereof, (c) platelet derived growth 
factor, or a variant thereof, (d) transforming growth factor, or a variant thereof, (e) the 1-261 portion of mature 

20 human plasma f ibronectin or a variant thereof, (f) the 278-578 portion of mature human plasma f ibronectin or 
a variant thereof, (g) the 1-272 portion of mature human von Willebrand's Factor or a variant thereof, or (h) 
alpha-1-antitrypsin or a variant thereof. 

The N-terminal portion of HSA is preferably the said 1-n portion, the 1-177 portion (up to and including 
the cysteine), the 1-200 portion (up to but excluding the cysteine) or a portion intermediate 1-177 and 1-200. 

25 The term "human serum albumin" (HSA) is intended to include (but not necessarily to be restricted to) 

known or yet-to-be-discovered polymorphic forms of HSA. For example, albumin Naskapi has Lys-372 in place 
of Glu-372 and pro-albumin Christchurch has an altered pro-sequence. The term "variants" is intended to in- 
clude (but not necessarily to be restricted to) minor artificial variations in sequence (such as molecules lacking 
one or a few residues, having conservative substitutions or minor insertions of residues, or having minor va- 

30 nations of amino acid structure). Thus polypeptides which have 80%, preferably 85%, 90%, 95% or 99%, hom- 
ology with HSAare deemed to be "variants". It is also preferred for such variants to be physiologically equivalent 
to HSA; that is to say, variants preferably share at least one pharmacological utility with HSA. Furthermore, 
any putative variant which is to be used pharmacologically should be non-immunogenic in the animal (espe- 
cially human) being treated. 

35 Conservative substitutions are those where one or more amino acids are substituted for others having sim- 

ilar properties such that one skilled in the art of polypeptide chemistry would expect at least the secondary 
structure, and preferably the tertiary structure, of the polypeptide to be substantially unchanged. For example, 
typical such substitutions include asparagine for glutamine, serine for asparagine and arginine for lysine. Va- 
riants may alternatively, or as well, lack up to ten (preferably only one or two) intermediate amino acid residues 

40 (ie not at the termini of the said N-terminal portion of HSA) in comparison with the corresponding portion of 
natural HSA; preferably any such omissions occur in the 100 to 369 portion of the molecule (relative to mature 
HSA itself) (if present). Similarly, up to ten, but preferably only one or two, amino acids may be added, again 
in the 100 to 369 portion for preference (if present). The term "physiologically functional equivalents" also en- 
compasses larger molecules comprising the said sequence plus a further sequence at the N-terminal (for ex- 

45 ample, pro-HSA, pre-pro-HSA and met-HSA). 

Clearly, the said "another polypeptide" in the fusion compounds of the invention cannot be the remaining 
portion of HSA, since otherwise the whole polypeptide would be HSA, which would not then be a "fusion poly- 
peptide". 

Even when the HSA-like portion is not the said 1-n portion of HSA, it is preferred for the non-HSA portion 
60 to be one of the said (a) to (h) entities. 

The 1 to 368 portion of CD4 represents the first four disulphide-linked immunoglobulin-like domains of the 
human T lymphocyte CD4 protein, the gene for and amino acid sequence of which are disclosed in D. Smith 
et al (1987) Science 328, 1704-1707. It is used to combat HIV infections. 

The sequence of human platelet-derived growth factor (PDGF) is described in Collins etal (1985) Nature 
55 316, 748-750. Similarly, the sequence of transforming growth factors p (TGF-p) is described in Derynck et al 
(1 985) Nature 316, 701-705. These growth factors are useful for wound-healing. 

AcDNA sequence for the 1-261 portion of Fn was disclosed in EP-A-207 751 (obtained from plasmid pFH6 
with endonuclease Pvu ll). This portion binds fibrin and can be used to direct fused compounds to blood clots. 
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A cDNA sequence for the 278-578 portion of Fn, which contains a collagen-binding domain, was disclosed 
by R.J. Owens and RE. Baralie in 1986 E.M.B.O.J. 5, 2825-2830. This portion will bind to platelets. 

The 1-272 portion of von Willebrand's Factor binds and stabilises factor VIII. The sequence is given in Bon- 
tham et a[, Nuci. Acids Res. 14, 7125-7127. 
5 Variants of alpha-1 -antitrypsin include those disclosed by Rosenburg et a[ (1984) Nature 312, 77-80. In 

particular, the present invention includes the Pittsburgh variant (Met 356 is mutated to Arg) and the variant where 
Pro 357 and Met 358 are mutated to alanine and arginine respectively. These compounds are useful in the treat- 
ment of septic shock and lung disorders. 

Variants of the non-HSA portion of the polypeptides of the invention include variations as discussed above 
10 in relation to the HSA portion, including those with conservative amino acid substitutions, and also homologues 
from other species. 

The fusion polypeptides of the invention may have N-terminal amino acids which extend beyond the por- 
tion corresponding to the N-terminal portion of HSA. For example, if the HSA-like portion corresponds to an 
N-terminal portion of mature HSA, then pre-, pro-, or pre-pro sequences may be added thereto, for example 

15 the yeast alpha-factor leader sequence. The fused leader portions of WO 90/01063 may be used. The poly- 
peptide which is fused to the HSA portion may be a naturally-occurring polypeptide, a fragment thereof or a 
novel polypeptide, including a fusion polypeptide. For example, in Example 3 below, a fragment of fibronectin 
is fused to the HSA portion via a 4 amino acid linker. 

It has been found that the amino terminal portion of the HSA molecule is so structured as to favour par- 

20 ticularly eff icient translocation and export of the fusion compounds of the invention in eukaryotic cells. 

A second aspect of the invention provides a transformed host having a nucleotide sequence so arranged 
as to express a fusion polypeptide as described above. By "so arranged", we mean, for example, that the nu- 
cleotide sequence is in correct reading frame with an appropriate RNA polymerase binding site and translation 
start sequence and is under the control of a suitable promoter. The promoter may be homologous with or het- 

25 erologous to the host. Downstream (3') regulatory sequences may be included if desired, as is known. The 
host is preferably yeast (for example Saccharomyces spp., e.g. S. cerevisiae ; Kluyveromyces spp., e.g. K. jaj> 
tjs; Pichia spp.; or Schizosaccharomyces spp., e.g. S. pombe) but may be any other suitable host such as E. 
coli , B. subtil is > Aspergillus spp., mammalian cells, plant cells or insect cells. 

A third aspect of the invention provides a process for preparing a fusion polypeptide according to the first 

30 aspect of the invention by cultivation of a transformed host according to the second aspect of the invention, 
followed by separation of the fusion polypeptide in a useful form. 

A fourth aspect of the invention provides therapeutic methods of treatment of the human or other animal 
body comprising administration of such a fusion polypeptide. 

In the methods of the invention we are particularly concerned to improve the efficiency of secretion of 

35 useful therapeutic human proteins from yeast and have conceived the idea of fusing to amino-terminal portions 
of HSA those proteins which may ordinarily be only inefficiently secreted. One such protein is a potentially 
valuable wound-healing polypeptide representing amino acids 585 to 1578 of human fibronectin (referred to 
herein as Fn 585-1578). As we have described in a separate application (filed simultaneously herewith) this 
molecule contains cell spreading, chemotactic and chemokinetic activities useful in healing wounds. The fusion 

40 polypeptides of the present invention wherein the C-terminal portion is Fn 585-1578 can be used for wound 
healing applications as biosynthesised, especially where the hybrid human protein will be topically applied. 
However, the portion representing amino acids 585 to 1578 of human fibronectin can if desired be recovered 
from the fusion protein by preceding the first amino acid of the fibronectin portion by amino acids comprising 
a factor X cleavage site. After isolation of the fusion protein from culture supernatant, the desired molecule is 

45 released by factor X cleavage and purified by suitable chromatography (e.g. ion-exchange chromatography). 
Other sites providing for enzymatic or chemical cleavage can be provided, either by appropriate juxtaposition 
of the N-terminal and C-terminal portions or by the insertion therebetween of an appropriate linker. 

At least some of the fusion polypeptides of the invention, especially those including the said CD4 and vWF 
fragments, PDGF and c^AT, also have an increased half-life in the blood and therefore have advantages and 

50 therapeutic utilities themselves, namely the therapeutic utility of the non-HSA portion of the molecule. In the 
case of a n AT and others, the compound will normally be administered as a one-off dose or only a few doses 
over a short period, rather than over a long period, and therefore the compounds are less likely to cause an 
immune response. 

55 EXAMPLES : SUMMARY 

Standard recombinant DNA procedures were as described by Maniatis et al (1982 and recent 2nd edition) 
unless otherwise stated. Construction and analysis of phage M13 recombinant clones was as described by 
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Messing (1 983) and Sanger eta[ (1 977). 

DNA sequences encoding portions of human serum albumin used in the construction of the following mol- 
ecules are derived from the plasmids mHOB12 and pDBD2 (EP-A-322 094, Delta Biotechnology Ltd, relevant 
portions of which are reproduced below) or by synthesis of oligonucleotides equivalent to parts of this se- 

5 quence. DNA sequences encoding portions of human fibronectin are derived from the plasmid pFHDELI, or 
by synthesis of oligonucleotides equivalent to parts of this sequence. Plasmid pFHDELI, which contains the 
complete human cDNA encoding plasma fibronectin, was obtained by ligation of DNA derived from plasmids 
pFH6, 16, 54, 154 and 1 (EP-A-207 751; Delta Biotechnology Ltd). 

This DNA represents an mRNA variant which does not contain the 'ED' sequence and had an 89-amino 

10 acid variant of the lll-CS region (R.J. Owens, A.R. Kornblihtt and F.E. Baralle (1986) Oxford Surveys on Eu- 
karyotic Genes 3 141-160). The map of this vector is disclosed in Fig. 11 and the protein sequence of the mature 
polypeptide produced by expression of this cDNA is shown in Fig. 5. 

Oligonucleotides were synthesised on an Applied Biosystems 380B oligonucleotide synthesiser according 
to the manufacturer's recommendations (Applied Biosystems, Warrington, Cheshire, UK). 

15 An expression vector was constructed in which DNA encoding the HSA secretion signal and mature HSA 
up to and including the 387th amino acid, leucine, fused in frame to DNA encoding a segment of human fibro- 
nectin representing amino acids 585 to 1578 inclusive, was placed downstream of the hybrid promoter of EP- 
A-258 067 (Delta Biotechnology), which is a highly efficient galactose-inducible promoter functional in Sac- 
charomyces cerevisiae. The codon for the 1578th amino acid of human f ibronectin was directly followed by a 

20 stop codon (TAA) and then the S. cerevisiae phosphoglycerate kinase (PGK) gene transcription terminator. 
This vector was then introduced into S. cerevisiae by transformation, wherein it directed the expression and 
secretion from the cells of a hybrid molecule representing the N-terminal 387 amino acids of HSA C-termin ally 
fused to amino acids 585 to 1578 of human fibronectin. 

In a second example a similar vector is constructed so as to enable secretion by S. cerevisiae of a hybrid 

25 molecule representing the N-terminal 195 amino acids of HSA C- terminally fused to amino acids 585 to 1578 
of human fibronectin. 

Aspects of the present invention will now be described by way of example and with reference to the ac- 
companying drawings, in which: 

Figure 1 (on two sheets) depicts the amino acid sequence currently thought to be the most representative 
30 of natural HSA, with (boxed) the alternative C-termini of HSA(1-n); 

Figure 2 (on two sheets) depicts the DNAsequence coding for mature HSA, wherein the sequence included 
in Linker 3 is underlined; 

Figure 3 illustrates, diagrammatically, the construction of mHOB16; 

Figure 4 illustrates, diagrammatically, the construction of pHOB31; 
35 Figure 5 (on 6 sheets) illustrates the mature protein sequence encoded by the Fn plasmid pFHDELI; 

Figure 6 illustrates Linker 5, showing the eight constituent oligonucleotides; 

Figure 7 shows schematically the construction of plasmid pDBDF2; 

Figure 8 shows schematically the construction of plasmid pDBDFS; 

Figure 9 shows schematically the construction of plasmid pDBDF9; 
40 Figure 10 shows schematically the construction of plasmid DBDF12, using plasmid pFHDELI; and 

Figure 11 shows a map of plasmid pFHDELI. 

EXAMPLE 1 : HSA 1-387 FUSED TO Fn 585-1578 

45 The following is an account of a preparation of plasmids comprising sequences encoding a portion of HSA, 

as is disclosed in EP-A-322 094. 

The human serum albumin coding sequence used in the construction of the following molecules is derived 
from the plasmid M13mp19.7 (EP-A-201 239, Delta Biotech- nology Ltd.) or by synthesis of oligonucleotides 
equivalent to parts of this sequence. Oligonucleotides were synthesised using phosphoramidite chemistry on 
so an Applied Biosystems 380B oligonucleotide synthesizer according to the manufacturer's recommendations 
(AB Inc., Warrington, Cheshire, England). 

An oligonucleotide was synthesised (Linker A) which represented a part of the known HSA coding se- 
quence (Figure 2) from the Pstl site (1235-1240, Figure 2) to the codon for valine 381 wherein that codon was 
changed from GTG to GTC: 

55 
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Linker 1 
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CCT CAT 


GAA 


TGC 


TAT 


3' ACGT 


CTA 


GGA GTA 


CTT 


ACG 


ATA 








1247 
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F D 
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£ 




GCC 


AAA 


GTG 


TTC GAT 


GAA 


TTT 


A TV 7k 

AAA 


CGG 


TTT 


CAC 


AAG CTA 


CTT 


AAA 


TTT 






1267 
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L 


V 










CTT 


GTC 


3' 










GGA 


CAG 


5' 











Linker 1 was ligated into the vector M13mp19 (Norrander et al , 1983) which had been digested with Pstl 
and Hindi and the ligation mixture was used to transfect E.coli strain XL1-Blue (Stratagene Cloning Systems, 
San Diego, CA). Recombinant clones were identified by their failure to evolve a blue colour on medium con- 
30 taining the chromogenic indicator X-gal (5-bromo-4-chloro-3-indolyl-0-D-galactoside) in the present of IPTG 
(isopropylthio-p-galactoside). DNA sequence analysis of template DNA prepared from bacteriophage particles 
of recombinant clones identified a molecule with the required DNA sequence, designated mHOB1 2 (Figure 3). 

M13mp19.7 consists of the coding region of mature HSA in M13mp19 (Norrander et al , 1983) such that 
the codon for the first amino acid of HSA GAT, overlaps a unique Xho l site thus: 

35 

Asp Ala 

5' CTCGAGATGCA 3' 

40 3' GAGCTCTACGT 5' 

Xho l 

45 (EP-A-210 239). M13mp19.7 was digested with Xho l and made flush-ended by S1-nuclease treatment and 
was then ligated with the following oligonucleotide (Linker 2): 

Linker 2 

50 

5' TCTTTTATCCAAGCTTGGATAAAAGA 3' 
3'AGAAAATAGGTTCGAACCTATTTTCT 5' 

55 

Hindlll 

The ligation mix was then used to transfect E.coli XL1-Blue and template DNA was prepared from several 
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plaques and then analysed by DNA sequencing to Identify a clone, pDBD1 (Figure 4), with the correct se- 
quence. 

A 1 .1 kb Hindlll to Pstl fragment representing the 5* end of the HSA coding region and one half of the in- 
serted oligonucleotide linker was isolated from pDBD1 by agarose gel electrophoresis. This fragment was then 

5 ligated with double stranded mHOB12 previously digested with Hindlll and PsJ and the ligation mix was then 
used to transfect E.coli XL1-Blue. Single stranded template DNA was prepared from mature bacteriophage par- 
ticles of several plaques. The DNA was made double stranded in vitro by extension from annealed sequencing 
primer with the Klenow fragment of DNA polymerase I in the presence of deoxynucleoside triphosphates. Re- 
striction enzyme analysis of this DNA permitted the identification of a clone with the correct configuration, 

10 mHOB15 (Figure 4). 

The following oligonucleotide (Linker 3) represents from the codon for the 382nd amino acid of mature HSA 
(glutamate, GAA) to the codon for lysine 389 which is followed by a stop codon (TAA) and a Hindlll site and 
then a Bam HI cohesive end: 

15 Linker 3 
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E 


P 


Q 


N 


L 


I 


K 


J 






20 


5' 


GAA 


GAG 


CCT 


CAG 


AAT 


TTA 


ATC 


AAA 


TAA 


GCTTG 


3' 




3' 


CTT 


CTC 


GGA 


GTC 


TTA 


AAT 


TAG 


TTT 


ATT 


CGAACCTAG 


5' 



25 This was ligated into double stranded mHOB15, previously digested with Hindi and Bam HI. After ligation, 

the DNA was digested with Hindi to destroy all non-recombinant molecules and then used to transfect E.coli 
XL1-Blue. Single stranded DNA was prepared from bacteriophage particles of a number of dones and sub- 
jected to DNA sequence analysis. One clone having the correct DNA sequence was designated mHOB1 6 (Fig- 
ure 4). 

30 A molecule in which the mature HSA coding region was fused to the HSA secretion signal was created by 

insertion of Linker 4 into Bam HI and Xho l digested M13mp19.7 to form pDBD2 (Figure 4). 

Linker 4 
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TTT 


40 
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50 



6 



EP 0 399 666 B1 



5 
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G 


V 
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TCG 


GCT 
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TCC 


AGG 


GGT 


GTG 


TTT 


AGC 


CGA 


ATA 


AGG 


TCC 


CCA 


CAC 


AAA 



CG 3/ 



GCAGCT 5 ' 



15 

In this linker the codon for the fourth amino acid after the initial methionine, ACC for threonine in the HSA 
pre-pro leader sequence (Lawn et al , 1981), has been changed to AGC for serine to create a Hindlli site. 

A sequence of synthetic DNA representing a part of the known HSA coding sequence (Lawn et al ., 1981) 
(amino acids 382 to 387, Fig. 2), fused to part of the known f ibronectin coding sequence (Kornblihtt et al ., 1985) 

20 (amino acids 585 to 640, Fig. 2), was prepared by synthesising six oligonucleotides (Linker 5, Fig. 6). The oli- 
gonucleotides 2, 3, 4, 6, 7 and 8 were phosphor ylated using T4 polynucleotide kinase and then the oligonu- 
cleotides were annealed under standard conditions in pairs, i.e. 1+8, 2+7, 3+6 and 4+5. The annealed oligo- 
nucleotides were then mixed together and ligated with mHOB12 which had previously been digested with the 
restriction enzymes Hindi and Eco Rl. The ligation mixture was then used to transfect E.coli XL1-Blue (Stra- 

25 tagene Cloning Systems, San Diego, CA). Single stranded template DNA was then prepared from mature bac- 
teriophage particles derived from several independent plaques and then was analysed by DNA sequencing. 
A clone in which a linker of the expected sequence had been correctly inserted into the vector was designated 
pDBDFI (Fig. 7). This plasmid was then digested with Pstl and Eco Rl and the approx. 0.24kb fragment was 
purified and then ligated with the 1.29kb Bam HI-Pstl fragment of pDBD2 (Fig. 7) and BamHI + Eco Rl digested 

30 pUC19 (Yanisch-Perron, et al. , 1985) to form pDBDF2 (Fig. 7). 

A plasmid containing a DNA sequence encoding full length human f ibronectin, pFHDELI, was digested 
with EcoR l and Xho l and a 0.77kb Eco Rl-xhol fragment (Fig. 8) was isolated and then ligated with Eco Rl and 
sai l digested M13 mp18 (Norrander et aL , 1983) to form pDBDF3 (Fig. 8). 

The following oligonucleotide linker (Linker 6) was synthesised, representing from the Pstl site at 4784- 

35 4791 of the f ibronectin sequence of EP-A-207 751 to the codon for tyrosine 1578 (Fig. 5) which is followed by 
a stop codon (TAA), a Hindlli site and then a BamH I cohesive end: 

Linker 6 



40 
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E 


M 
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I 


E 


G 


L 




GGT 


CCA 


GAT 


CAA 


ACA 


GAA 


ATG 


ACT 


ATT 


GAA 


GGC 


TTG 


45 


A CGT CCA 


GGT 


CTA 


GTT 


TGT 


CTT 


TAC 


TGA 


TAA 


CTT 


CCG 


AAC 



50 



Q 


P 


T 


V 


E 


y 


Stop 


CAG 


CCC 


ACA 


GTG 


GAG 


TAT 


TAA 


GTC 


GGG 


TGT 


CAC 


CTC 


ATA 


ATT 



55 This linker was then ligated with Pstl and Hindlli digested pDBDF3 to form pDBDF4 (Fig. 8). The following 

DNAfragments were then ligated together with BgHI digested pKV50 (EP-A-258 067) as shown in Fig. 8: 0.68kb 
EcoRI- Bam HI fragment of pDBDF4 t 1 .5kb Bam HI-StuI fragment of pDBDF2 and the 2.2kb Stul-EcoRI fragment 
of pFHDELI. The resultant plasmid pDBDF5 (Fig. 8) includes the promoter of EP-A-258 067 to direct the ex- 
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pression of the HSA secretion signal fused to DNA encoding amino acids 1-387 of mature HSA, in turn fused 
directly and in frame with DNA encoding amino acids 585-1578 of human fibronectin, after which translation 
would terminate at the stop codon TAA. This is then followed by the S.cerevisiae PGK gene transcription ter- 
minator. The plasmid also contains sequences which permit selection and maintenance in Escherichia coli and 
5 S.cerevisiae (EP-A-258 067). 

This plasmid was introduced into S.cerevisiae S150-2B (leu2-3 l eu2-11 2 ura3-52 trp1-289 his3- 1) by stan- 
dard procedures (Beggs, 1978). Transformants were subsequently analysed and found to produce the HSA- 
f ibronectin fusion protein. 

10 EXAMPLE 2 : HSA 1-195 FUSED TO Fn 585-1578 

In this second example the first domain of human serum albumin (amino acids 1-195) is fused to amino 
acids 585-1578 of human fibronectin. 

The plasmid pDBD2 was digested with Bam HI and BgHI and the 0.79kb fragment was purified and then 
15 ligated with BamHI-digested M13mp19 to form pDBDF6 (Fig. 6). The following oligonucleotide: 

5'-CCAAAGCTCGAGGAACTTC G-3' 

was used as a mutagenic primer to create a Xhol site in pDBDF6 by in vitro mutagenesis using a kit supplied 
20 by Amersham International PLC. This site was created by changing base number 696 of HSA from a T to a G 
(Fig. 2). The plasmid thus formed was designated pDBDF7 (Fig. 9). The following linker was then synthestsed 
to represent from this newly created Xho l site to the codon for lysine 195 of HSA (AAA) and then from the 
codon for isoleucine 585 of fibronectin to the ends of oligonucleotides 1 and 8 shown in Fig. 6. 

25 Linker 7 



D E L R D E G 
30 TC GAT GAA CTT CGG GAT GAA GGG 

A CTT GAA GCC CTA CTT CCC 



35 

ITETPSQPNSH 
ATC ACT GAG ACT CCG AGT CAG C 
40 TAG TGA CTC TGA GGC TCA GTC GGG TTG AGG GTG G 

This linker was ligated with the annealed oligonucleotides shown in Fig. 3, i.e. 2+7, 3+6 and 4+5 together 
with Xho l and EcoRI digested pDBDF7 to form pDBDF8 (Fig. 9). Note that in order to recreate the original 
HSA DNA sequence, and hence amino acid sequence, insertion of linker 7 and the other oligonucleotides into 

45 pDBDF7 does not recreate the Xho l site. 

The 0.83kb Bam Hi-StuI fragment of pDBDF8 was purified and then was ligated with the 0.68kb EcoRI- 
Bam HI fragment of pDBDF2 and the 2.22kb Stul-EcoRI fragment of pFHDELI into Bg Ill-digested pKV50 to 
form pDBDF9 (Fig. 9). This plasmid is similar to pDBDF5 except that it specifies only residues 1-195 of HSA 
rather than 1-387 as in pDBDF5. 

50 When introduced into S.cerevisiae S150-2B as above, the plasmid directed the expression and secretion 

of a hybrid molecule composed of residues 1-195 of HSA fused to residues 585-1578 of fibronectin. 

EXAMPLE 3 : HSA 1-387 FUSED TO Fn 585-1578, AS CLEAVABLE MOLECULE 

55 In order to facilitate production of large amounts of residues 585-1578 of fibronectin, a construct was made 

in which DNA encoding residues 1-387 of HSA was separated from DNA encoding residues 585-1 578 of fibro- 
nectin by the sequence 



K A S 
AAG GCT TCG 
TTC CGA AGO 



S A K 
TCT GCC AAA 
AGA CGG TTT 
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I E G R 
ATT GAA GGT AGA 

5 

TAA CTT CCA TCT 

which specifies the cleavage recognition site for the blood clotting Factor X Consequently the purified secreted 
product can be treated with Factor X and then the fibronectin part of the molecule can be separated from the 
10 HSA part. 

To do this two oligonucleotides were synthesised and then annealed to form Linker 8. 
Linker 8 



15 


E 


E 


P 


Q 


N 


L 


I 


E 


G 




GAA 


GAG 


CCT 


CAG 


AAT 


TTA 


ATT 


GAA 


GGT 




CTT 


CTC 


GGA 


GTC 


TTA 


AAT 


TAA 


CTT 


CCA 


20 






















R 


I 


T 


E 


T 


P 


S 


Q 


P 


25 


AGA 


ATC 


ACT 


GAG 


ACT 


CCG 


AGT 


CAG 


C 




TCT 


TAG 


TGA 


CTC 


TGA 


GGC 


TCA 


GTC 


GGG 


30 






















N 


S 


H 














35 


TTG 


AGG 


GTG 


G 













This linker was then ligated with the annealed oligonucleotides shown in Fig. 6, i.e. 2+7, 3+6 and 4+5 into 
Hindi andEcoRI digested mHOB12, to form pDBDF10 (Fig. 7). The plasmid was then digested with Pstl and 
EcoRI and the roughly 0.24kb fragment was purified and then ligated with the 1 .29kb Bam HI-Pstl fragment of 
40 pDBD2 and Bam HI and Eco RI digested pUC19 to form pDBDF11 (Fig. 10). 

The 1.5kb Bam HI-StuI fragment of pDBDF11 was then ligated with the 0.68kb EcoRI- Bam H1 fragment of 
pDBDF4 and the 2.22kb Stul-EcoRI fragment of pFHDELI into BoJII-digested pKV50 to form pDBDF12 (Fig. 
10). This plasmid was then introduced into S.cerevisiae S150-2B. The purified secreted fusion protein was 
treated with Factor X to liberate the fibronectin fragment representing residues 585-1578 of the native mole- 
45 cule. 
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Claims 

Claims for the following Contracting States : AT, BE, CH, LI, DE f DK, FR, IT, LU. NL, SE 

1. A fusion polypeptide comprising, as at least part of the N-terminal portion thereof, an N-terminal portion 
of HSA or a variant thereof and, as at least part of the C-terminal portion thereof, another polypeptide 
except that, when the said N-terminal portion of HSA is the 1-n portion where n is 369 to 419 or a variant 
thereof then the said polypeptide is (a) the 585 to 1578 portion of human f ibronectin or a variant thereof, 
(b) the 1 to 368 portion of CD4 or a variant thereof, (c) platelet derived growth factor or a variant thereof, 
(d) transforming growth factor p or a variant thereof, (e) the 1-261 portion of mature human plasma fi- 
bronectin or a variant thereof, (0 the 278-578 portion of mature human plasma fibronectin or a variant 
thereof, (g) the 1-272 portion of mature human von Willebrand's Factor or a variant thereof, or (h) alpha- 
1 -antitrypsin or a variant thereof. 

2. A fusion polypeptide according to Claim 1 additionally comprising at least one N-terminal amino acid ex- 
tending beyond the portion corresponding to the N-terminal portion of HSA. 

3. A fusion polypeptide according to Claim 1 or 2 wherein there is a cleavabie region at the junction of the 
said N-terminal or C-terminal portions. 

4. A fusion polypeptide according to any one of the preceding claims wherein the said C-terminal portion is 
the 585 to 1578 portion of human plasma fibronectin or a variant thereof. 

5. A transformed or transf ected host having a nucleotide sequence so arranged as to express a fusion poly- 
peptide according to any one of the preceding claims. 

6. A process for preparing a fusion polypeptide by cultivation of a host according to Claim 5, followed by sep- 
aration of the fusion polypeptide in a useful form. 

7. A fusion polypeptide according to any one of Claims 1 to 4 for use in therapy. 
Claims for the following Contracting States : ES, GR 

1. A process for preparing a fusion polypeptide by (i) cultivation of a transformed or transfected host having 
a nucleotide sequence so arranged as to express a fusion polypeptide, followed by (ii) separation of the 
fusion polypeptide in a useful form, characterised in that the fusion polypeptide comprises as at least part 
of the N-terminal portion thereof, an N-terminal portion of HSA or a variant thereof and, as at least part 
of the C-terminal portion thereof, another polypeptide except that, when the said N-terminal portion of 
HSA is the 1-n portion where n is 369 to 419 or a variant thereof then the said polypeptide is (a) the 585 
to 1 578 portion of human fibronectin or a variant thereof, (b) the 1 to 368 portion of CD4 or a variant there- 
of, (c) platelet derived growth factor or a variant thereof, (d) transforming growth factor p or a variant there- 
of, (e) the 1-261 portion of mature human plasma fibronectin or a variant thereof, (f) the 278-578 portion 
of mature human plasma fibronectin or a variant thereof, (g) the 1-272 portion of mature human von Will- 
ebrand's Factor or a variant thereof, or (h) alpha-1 -antitrypsin or a variant thereof. 

2. A process according to Claim 1, wherein the fusion polypeptide additionally comprising at least one N- 
terminal amino acid extending beyond the portion corresponding to the N-terminal portion of HSA. 

3. A process according to Claim 1 or 2 wherein, in the fusion polypeptide, there is a cleavabie region at the 
junction of the said N-terminal or C-terminal portions. 

4. A process according to any one of the preceding claims wherein the said C-terminal portion is the 585 
to 1578 portion of human plasma fibronectin or a variant thereof. 
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Patentanspruche 

Patentanspruche fur folgende Vertragsstaaten : AT, BE, CH, DE, DK, FR, IT, LU, NL, SE 

5 1. Fusionspolypeptid, umfassend als mindestens einen Teil seines N-terminalen Teils einen N-terminalen 
Teil von HSA oder eine Variante davon und als mindestens einen Teil seines C-terminalen Teils ein wei- 
teres Polypeptid mit der Ausnahme, da& wenn es sich bei dem N-terminalen Teil von HSA urn den Teil 1- 
n mit n = 369 bis 419 oder eine Variante davon handelt, das Polypeptid aus 
(a) dem Teil 585 bis 1578 von Humanfibronectin oder einer Variante davon, 

10 (b) dem Teil 1 bis 368 von CD4 oder einer Variante davon, 

(c) dem "Platelet Derived Growth Factor" (PDGF) oder einer Variante davon, 

(d) dem "Transforming Growth Factor p" (TGF p) oder einer Variante davon, 

(e) dem Teil 1-261 von reifem Humanplasmaf ibronectin oder einer Variante davon, 

(f) dem Teil 278-578 von reifem Humanplasmaf ibronectin Oder einer Variante davon, 

15 (g) dem Teil 1-272 von reifem Human-von Willebrand's-Faktor oder einer Variante davon oder 

(h) Alpha-1 -Antitrypsin oder einer Variante davon, besteht 

2. Fusionspolypeptid nach Anspruch 1 , zusatzlich umfassend mindestens eine N-terminale Aminosaure, die 
langer als der dem N-terminalen Teil von HSA entsprechende Teil ist 

20 

3. Fusionspolypeptid nach Anspruch 1 oder 2, bei dem sich an der Verbindung der N-terminalen oder C- 
terminalen Teile eine spaltbare Region befindet. 

4. Fusionspolypeptid nach einem der vorhergehenden Anspruche, wobei der C-terminale Teil aus dem Teil 
585 bis 1578 von Humanplasmaf ibronectin oder einer Variante davon besteht. 

25 

5. Transformierter oder transfizierter Wirt mit einer Nukleotidsequenz, die so angeordnet ist, dad sie ein Fu- 
sionspolypeptid nach einem der vorhergehenden Anspruche exprimieren kann. 

6. Verfahren zur Herstellung eines Fusionspolypeptids durch Kulthvieren eines Wirts nach Anspruch 5 und 
30 anschlie&endes Abtrennen des Fusionspolypeptids in einer geeigneten Form. 

7. Fusionspolypeptid nach einem der Anspruche 1 bis 4 zur therapeutischen Verwendung. 
Patentanspruche fur folgende Vertragsstaaten : ES, GR 

35 

1. Verfahren zur Herstellung eines Fusionspolypeptids durch 

(i) Kultivieren eines transfer mierten oder transfektierten Wirts mit einer Nukleotidsequenz, die so an- 
geordnet ist, dali sie ein Fusionspolypeptid exprimiert, und 

(ii) anschlie&endes Abtrennen des Fusionspolypeptids in einer geeigneten Form, 
40 dadurch gekennzeichnet, daB das Fusionspolypeptid als mindestens einen Teil seines N-terminalen Teils 

einen N-terminalen Teil von HSA oder eine Variante davon und als mindestens einen Teil seines C-ter- 
minalen Teils ein weiteres Polypeptid umfa&t, mit der Ausnahme, daB wenn es sich bei dem N-terminalen 
Teil von HSA urn den Teil 1-n mit n= 369 bis 419 oder eine Variante davon handelt, das Polypeptid aus 
(a) dem Teil 585-1578 von Humanfibronectin oder einer Variante davon, 
45 (b) dem Teil 1-368 von CD4 oder einer Variante davon, 

(c) dem Platelet Derived Growth Factor Oder einer Variante davon, 

(d) dem Transforming Growth Factor p oder einer Variante davon, 

(e) dem Teil 1-261 von reifem Humanplasmaf ibronectin oder einer Variante davon, 

(f) dem Teil 278-578 von reifem Humanplasmaf ibronectin oder einer Variante davon, 

so (g) dem Teil 1-272 von reifem Human-von Willebrand's-Faktor oder einer Variante davon oder 

(h) a-1 -Antitrypsin oder einer Variante davon besteht. 

2. Verfahren nach Anspruch 1, wobei das Fusionspolypeptid zusatzlich mindestens eine N-terminale Ami- 
nosaure, die langer als der dem N-terminalen Teil von HSA entsprechende Teil ist. umfaRt 

55 

3. Verfahren nach Anspruch 1 oder 2, wobei sich in dem Fusionspolypeptid an der Verbindung der N-termi- 
nalen oder C-terminalen Teile eine spaltbare Region befindet 
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4. Verfahren nach einem der vorhergehenden Anspruche, wobei der C-terminale Teil aus dem Teil 585-1578 
von Humanplasmaf ibronectin Oder einer Variante davon besteht. 



Revendlcations 

Revendications pour les Etats contractants suivants : AT, BE, CH, DE, DK, FR, IT, LU, NL, SE 

1. Polypeptide fusionne comprenant en tant qu'au moins une partie de sa portion N-terminale, une portion 
N-ter mtnale de HSA ou d'un variant de celle-ci et, en tant qu'au moins une partie de sa portion C-termi- 
nale, un autre polypeptide sauf que, lorsque cette portion N-terminale de HSA est ia portion 1-n dans la- 
quelle n est 369 a 419 ou un variant de celle-ci, ce polypeptide est (a) la portion 585 a 1578 de la f ibro- 
nectine humaine ou un variant de celle-ci, (b) la portion 1 a 368 de CD4 ou un variant de celle-ci, (c) le 
facteur de croissance derive des plaquettes sanguines ou un variant de celui-ci, (d) le facteur de crois- 
sance p de transformation ou un variant de celui-ci, (e) la portion 1-261 de la f ibronectine mature de plas- 
ma humain ou un variant de celle-ci, (f) la portion 278-578 de la f ibronectine mature de plasma humain 
ou un variant de celle-ci, (g) la portion 1-272 du facteur humain mature de von Willebrand ou un variant 
de celle-ci, ou (h) Palpha-1-antitrypsine ou un variant de celie-ci. 

2. Polypeptide fusionne suivant la revendication 1 , comprenant de plus au moins un acide amine N-terminal 
se prolongeant au-dela de la portion correspondant a la portion N-terminale de MSA. 

3. Polypeptide fusionne suivant les revendications 1 ou 2, dans lequel il y a une region susceptible d'etre 
coupee a la jonction de ces portions N-terminale et C-terminate. 

4. Polypeptide fusionne suivant I'une quelconque des revendications precedentes, dans lequel cette portion 
C-terminale est la portion 585 a 1578 de la f ibronectine de plasma humain ou un variant de celle-ci. 

5. H6te transforms ou transfecte ayant une sequence de nucleotides arrangee de facon a exprimer un po- 
lypeptide fusionne suivant Tune quelconque des revendications precedentes. 

6. Procede pour preparer un polypeptide fusionne par culture d'un h6te suivant la revendication 5, suivie de 
la separation du polypeptide fusionne sous une forme utile. 

7. Polypeptide fusionne suivant I'une quelconque des revendications 1 a 4 utilisable en therapie. 
Revendications pour les Etats contractants suivants : ES, GR 

1. Procede pour preparer un polypeptide fusionne par (i) la culture d'un hfite transforme ou transfecte ayant 
une sequence de nucleotides arrangee de facon a exprimer un polypeptide fusionne, suivie de (ii) la se- 
paration du polypeptide fusionne sous une forme utilie, caracterise en ce que le polypeptide fusionne 
comprend, en tant qu'au moins une partie de sa portion N-terminale, une portion N-terminale de HSAou 
d'un variant de celle-ci et, en tant qu'au moins une partie de sa portion C-terminale, un autre polypeptide 
sauf que, lorsque cette portion N-terminale de HSA est la portion 1-n dans laquelle n est 369 a 419 ou 
un variant de celle-ci, ce polypeptide est alors (a) la portion 585 a 1578 de ia f ibronectine humaine ou un 
variant de celle-ci, (b) la portion 1 a 368 de CD4 ou un variant de celle-ci, (c) le facteur de croissance 
derive des plaquettes sanguines ou un variant de celui-ci, (d) le facteur de croissance p de transformation 
ou un variant de celui-ci, (e) la portion 1-261 de la f ibronectine mature de plasma humain ou un variant 
de celle-ci, (f) la portion 278-578 de la f ibronectine mature de plasma humain ou un variant de celle-ci, 
(g) la portion 1-272 du facteur humain mature de von Willebrand ou un variant de celle-ci, ou (h) I'alpha- 
1-antitrypsine ou un variant de celle-ci. 

2. Procede suivant la revendication 1, dans lequel le polypeptide fusionne comprend de plus au moins un 
acide amine N-terminal se prolongeant au-dela de la portion correspondant a la portion N-terminale de 
HSA. 

3. Procede suivant les revendications 1 ou 2 dans lequel, dans le polypeptide fusionne, il y a une region sus- 
ceptible d'etre coupee a la jonction de ces portions N-terminale et C-terminale. 
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4. Proced6 suivant Tune quelconque des revendications precedentes, dans lequel cette portion C-terminale 
est la portion 585 a 1578 de la fibronectine de plasma humain ou un variant de celle-ci. 
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FIGURE 1 



10 20 
Asp Ala His Lys Ser Giu Vai Aia .-lis Arg Phe Lys Asp Leu Gly Glu Giu Asn ?he Lys 

30 40 
Ala Leu vai Leu lie Ala ?he Aia Gin Tyr leu Gin Gin Cys Pro Phe Giu Asp His Vai 

50 60 
Lys Leu Vai Asn Giu Vai Thr Glu Phe Ala Lys Thr Cys Vai Ala Asp Glu Ser Ala Glu 

70 30 
Asn Cys Asp Lys Ser Leu His Thr Leu Phe Gly Asp Lys Leu Cys Thr Vai Aia Thr Leu 

90 100 
Arg Giu Thr Tyr Gly Giu Met Aia Asp Cys Cys Ala Lys Gin Glu Pro Giu Arg Asn Glu 

110 120 
Cys Phe Leu Gin Sis Lys Asp Asp Asn Pro Asn Leu Pro Arg Leu Vai Arg Pro Glu Vai 

130 140 
Asp val Met Cys Thr Aia ?he His Asp Asn Glu Glu Thr Phe Leu Lys Lys Tyr Leu Tyr 

150 160 
Glu He Ala Arg Arg His Pro Tyr Phe Tyr Ala Pro Glu Leu Leu Phe Phe Ala Lys Arg 

170 180 
Tyr Lys Ala Ala Phe Thr Glu Cys Cys Gin Aia Ala Asp Lys Ala Ala Cys Leu Leu Pro 

190 200 
Lys Leu Asp Glu Leu Arg Asp Glu Gly Lys Ala Ser Ser Aia Lys Gin Arg Leu Lys Cys 

210 220 
Aia Ser Leu Gin. Lys Phe Gly Glu Arc Ala Phe Lys Ala Trp Ala Vai Aia Arg Leu Ser 

230 240 
Gin Arg Phe Pro Lys Aia Giu Phe Ala Glu Val Ser Lys Leu Val Thr Asp Leu Thr Lys 

250 2S0 
Vai His Thr Glu Cys Cys Sis Gly Asp Leu Leu Glu Cys Ala Asp Asp Arg Ala Asp Leu 

270 280 
Aia Lys Tyr lie Cys Giu Asn Gin Asp Ser lie Ser Ser Lys Leu Lys Giu Cys Cys Giu 

290 ' 300 

Lys Pro Leu Leu Glu Lys Ser h'is Cys lie Ala Glu Val* Giu Asn Asp Glu Met Pro Ala 

310 320 
Asp Leu Pro Ser Leu Ala Aia Asp Phe vai Giu Ser Lys Asp val Cys Lys Asn Tyr Ala 

330 340 
Giu Ala Lys Asp vai Phe Leu Gly Met Phe Leu Tyr Glu Tyr Ala Arg Arg His Pro Asp 

350' 360 
Tyr Ser Val Vai Leu Leu Leu Arg Leu Ala Lys Thr Tyr Giu Thr Thr Leu Glu Lys Cys 



Cys Ala Ala Aia Asp Pro h'is Giu 



370 330 
Cys Tyr Ala Lys Vai Phe Asp Giu Phe Lys Pro Leu 
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flC-oRZ 1 Cor.t 



val CI. Clu Pro Gin Asn Leu. l ie Lys Gin Asn C ys Clu Leu Phe ci« Gin Leu Gl, c?° 
Tyr Lys Phe Gl., Asn Ala Leu Lea Val at? Tyr 7hf Lys Lys VaX ?rQ c , a ^ 
Pro Thr Val Glu Val ser Arg Asn Leu Cl r Lys Val Gly Ser Lys Cys Cys Lys SI 
Pro Glu Ala Lys Arg Met Pro Cys Ala 111 Asp Tyr Lau Ser Val Val Lau Asn Gin Leu 
Cys val Lau »i s Glu Lys Thr Pro val sir Asp Arg Val Thr Lys Cys Cys Thr Glu Ser 
Lau val Asn Arg Arg Pro Cys Phe Ser Ala Lau Glu Val Asp Glu Thr Tyr Val Pro 5s 
Glu Phe Asn Ala Glu Thr Phe Thr Phe nls Ala Asp lie Cys Thr Leu Ser Glu Lys "u 
Arg Gin He Lys Lys Gin Thr Ala Leu Val Glu Lau Val Lys His Lys Pro Lys .Ala Thr 
Lys Glu Gin Leu Lys Ala Val Met Asp III Phe Ala Ala Phe Val Glu Lys Cys Cys Lys 
Ala Asp Asp Lys Glu Thr Cys Phe Ala clu Glu Gly Lys Lys Leu Val. Ala Ala Ser 2Z 
Ala Ala Gly Leu 
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FIGURE 2 DN'A sequence coding for ma cure HSA 



10 20 30 40 50 60 70 80 

GATGCACACAAGAG7GAGG7TGCTCATCGGrTTAAAGATTTGGGAGAAGAAAATTTC\AAGCCT7GGTGTTGATTGCCrT 
OA " X S 2 V A H R ? X 0 LG Z Z H 7 X A L V L Z AT 

50 100 ■' HO 120 130 140 150 160 

TGCT C AG 7 A7 CTTC A GC AG T G TCC A TTT G AAG A T C A TG T AAAA TT AGTG AATG AAG T AACTG AATTTGC AAAAAC ATG7G 
AQYLQQC??SDKVKLVNSVTS?AX?C 

■70 180 190 200 210 220 230 240 

TTGCTGATGAGTCAGCTGAAAATTGTGACAAATCACTTCATACCCrTTTTGGAGACAAATTATGCACAGTTGCAACTCTT 
VA DZSAINC0KSLKTL7GOKLCTVATL 

250 260 270 280 290 300 310 320 

CGTGAAACCrATGGTGAAATGGCTGACTGCTGTGCAAAACAAGAACCTGAGAGAAATGAATGCTTCTTGCAACA 
H Z T G S" M A 0. C C A X Q Z P Z R N Z C T L Q H X 0 

330 340 350 360 370 380 390 400 

TGACAACCCAAACCTCCCCCGATTGGTGAGACCAGAGGTTGATGTGATGTGCAC7GCTTTTCATGACAATGAAGAGACAT 
DNPNL PRLVRPEVDVMCTAFHDNEZT 

410 420 430 440 450 460 470 480 

TTTTGAAAAAATACTTATATGAAATTCCCAGAA^^ 

?LX:<YLYSIARR3?Y7!fAPZLLr?AXR 

490 500 510 520 530 540 550. 560 

TATAAAGCTGC7TTTACAGAATGTTGCCAAGCTGCTGATAAAGCTGC 
YXAAF TECCQAADXAACL I. ? X L D Z L R C 

570 530 590 600 610 620 630 640 

TGAAGGGAAGGCTTCGTCTGCCAAACAGAGACTCAAfcTGTGCCAGT 

ZGKAS S AXQRLXCAS L Q X- ? G Z R A 7 X A 

650 650 570 680 690 700 710 720 

gggcagtgc<:tcgcctgagccagagatttcccaaagctgagtc^ 

w a v a r l 5 q r 7 ? x a z f a e v s x l v t d l t k 

730 740 750 760 770 780 790 300 

GTCCACACGGAJ\TGCTGCCATGGAGATC7GC7TGPJVTGTGCTGATGACAGGGCGGACCTTGCCAAC7ATATCTG7GAAAA 
V Cj T Z C C HGDLL ZC A 0 DRA 0 L A X i I C Z N 

810 320 830 840 850 860 870 880 

TCAGGA77CGATCTCCAG7AAAC7GAAGGAA7GC7G7GAAAAACCTC7G7TGGAAAJVATCCGAC7GCAT7GCCGAAGTGG 
QDSISSXiXZCCZXPLLiKSnClAZV 

£90 500 910 920 .930 940 950 560 

AAAATCA7GAGATGCC7GCTGAC7TGCCTTCA7TAGCTGC7GA7777G77GAAJ\G7AAGGA7G77TGCAAAAAC7ATGCT 
ZNDZM ?ADL?SLAADrvZ5 X D V C X N ? A 

970 980 990 1000 1010 1020 1030 1040 

GAGGCAAAGGATG7C77CCrGGGCATG77rT7GTA7GAATATGCAAGAAGGCATCC7GATTACTC7G7CG7GCTGCTGC7 
ZAXDV 7 LGMrLYZYARRHPD Y 5 V V L L L 
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TJG'JRZ 2 Cchl . 

1050 1060 '.070 1080 1090 1 1 00 UJi) 1120 

GAGACTTGCCA.^GACATATGAAACCACrClAGAGAAGTGCTGTGCCGCrGCAGATCCTCATGAATGCTATCCCWkGTGT 
RLAXTYE771EXCCAAAD? H £ C Y A X V 

1130 1140 1150 1160 1170 1180 1190 1200 

rCGATGAATTTAAACCTCTTGTG GAAGAGCCrCAGAATTTAAT CAAACAAAACrGTGAGCTTTTTGAGCAGCTTGGAGAG 
F D 2 7 X ? L V Z ? Q S Z *I KQHCZLTZQLOZ 

1210 1220 1230 1240 1250 1260 1270 12S0 

TACAAATTCCAGAATGCCCTATTAGTTCGTTACACCAAGAAAGTACCCCAAGTGTCAACTCCAACTCTTGTAGAGGTCTC 
YK7QNALLVaYTXKV ?QVST?TLVZV 5 

1290 1300 1310 1320 1330 1340 1350 1360 

AACAAACCTAGGAAAAGTGGGCAGCAAATGTTGTAAACAkTCCT 

?. H - G X V G S X C C X K ? 2 A K R M P C A E D Y L 

1370 1380 1390 1400 1410 1420 1430 1440 

CCGTGGTCCTGAACCAGTTATGTGTGTTGCATGAGAAAACGCCAGTAAGTGACAGAGTCACAAAATGCTGCACAGAGTCC 
SVVL.NQLCVLHIKTPVSDaVTXCCTES 

1450 1460 1470 1480 1490 1500 1510 1520 

TTGGTGAACAGGCGACCATGCTTTTCAGCTCTGGAAGTCGATGAA^CATACGTTCCCAAAGAGTTTAATG^TGAAACATT 
L V N R R ? C F S A L E V - D E T Y V P K S ? N A Z T ?. 

1530 1540 1550 1560 1570 1580 1590 1600 

CACCTTGCATGCAGATA.TATGCACACTTTCTGAGAAGGAGAGACAAATCVAGAAACAAACTGCACTTGTTGAGCTTGTGA 
T7KAD ICTLS5XERQIXX-QTALV2LV 

1610 1620 1630 1640 1650 1660 1670 1630 

AACACAAGCCCAAGGCAACAAAAGAGCAACTGAAAGCTG7TATGGATGA7TTCGCAGCTTTTGTAGAGAAGTGCTGC\AG 
KHX?XATX£QLXAVMDD?AAFV2XCCX 

1650 1700 1710 '.720 1730 1740 1750 1760 

GC7GACGA7AAGGAGACC7GCT7TGCCGAGGAGGG7AAAAAACT7GTTGC7GCAAGTCAAGC7CCC77AGGCTTA.TAACA 
A DDKZ TC7AEZ GX KLVAA S Q A A L G L 

1770 1780 
TCTACATTTAAAAGCATC7CAG 
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GAAGAGCCTCAGAATTTAATCACTGAGACTCCGAGTCAGCCCAACTCCCACCCCATCCAGTGG 
CTTCTCGGAGTCTTAAATTAGTGACrCTGAGGCTCAGTCGGGTTGAGGGTGGGGTAGGTCACC 



epqnlite 



tp sqpnshpiqw 



AATGCACCACAGCCATCTCACATTTCCAAGTACATTCTCAGGTGGAGACCTAAAAATTCTGTA 
TTACGTGGTGTCGGTAGAGTGTAAAGGTTCATGTAAGAGTCCACCTCTGGATTT TTAAGACAT 

* ■ P 5 *j ■ * i s k y i 1 r w r p k n s v 
7 



GGCCGTTGGAAGGAAGCTACCATACCAGGCCACTTAAACTCCTACACCATCAAAGGCCTG ^ 
CCGGCAACCTTCCTTCGATGGTATGGTCCGGTGAATTTGAGGATGTGGTAGTTTCCGGACTTAA 



g|rwkeatipghlns 
6 



y t i k g 1. 



Figure 6 Linker 5 showing the eight constituent oligonucleotides 
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Fig. 7 Construction of pDBDF2 
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Fig. 8 Construction of pDBDF5 
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^Pstl-EcoRI fragment 



BamHI+EcoRl digested 
PUC19 




BglE-digested 
~~ pKV50 



B=BamHI.Bq = BglII f 
E=EcoRI.Hc = Hjncn, 
P=PstI,St=Stul 



Fig. 10 Construction of pDBDF12 
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Fig\ire LL 



Name; pFHDELl 

Veczor: pUCl8 Amp^v 2860bp 

Insert: hFNcDNA - 7630bp 
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